The Data

Column {data-width = 650}

Data

Anscombe’s Quartet
ID x1 x2 x3 x4 y1 y2 y3 y4
1 10 10 10 8 8.04 9.14 7.46 6.58
2 8 8 8 8 6.95 8.14 6.77 5.76
3 13 13 13 8 7.58 8.74 12.74 7.71
4 9 9 9 8 8.81 8.77 7.11 8.84
5 11 11 11 8 8.33 9.26 7.81 8.47
6 14 14 14 8 9.96 8.10 8.84 7.04
7 6 6 6 8 7.24 6.13 6.08 5.25
8 4 4 4 19 4.26 3.10 5.39 12.50
9 12 12 12 8 10.84 9.13 8.15 5.56
10 7 7 7 8 4.82 7.26 6.42 7.91
11 5 5 5 8 5.68 4.74 5.73 6.89

Column {data-width = 350}

Data Summary

  • Four pairs of {x, y} variables:
    {x1, y1} to {x4, y4}

  • 11 rows

  • Published by Francis J. Anscombe (1973):
    Graphs in statistical analysis.
    The American Statistician, 27, 17–21

Descriptive Stats

Column

Means

Means of x and y
Set x y
1 9 7.500909
2 9 7.500909
3 9 7.500000
4 9 7.500909

Column

Standard Deviations

Standard Deviations
of x and y
Set x y
1 3.316625 2.031568
2 3.316625 2.031657
3 3.316625 2.030424
4 3.316625 2.030578

Correlation and Regression

Column

Correlations

Correlations between x and y
Set cor
1 0.8164205
2 0.8162365
3 0.8162867
4 0.8165214

Column

Regression: y ~ x

Regression formulas
Set (Intercept) x
1 3.000091 0.5000909
2 3.000909 0.5000000
3 3.002454 0.4997273
4 3.001727 0.4999091

Visualizations

Row

x1 and y1

x2 and y2

x3 and y3

x4 and y4

Overview: All plots